Comparisons of Non-Gaussian Statistical Models in DNA Methylation Analysis
نویسندگان
چکیده
As a key regulatory mechanism of gene expression, DNA methylation patterns are widely altered in many complex genetic diseases, including cancer. DNA methylation is naturally quantified by bounded support data; therefore, it is non-Gaussian distributed. In order to capture such properties, we introduce some non-Gaussian statistical models to perform dimension reduction on DNA methylation data. Afterwards, non-Gaussian statistical model-based unsupervised clustering strategies are applied to cluster the data. Comparisons and analysis of different dimension reduction strategies and unsupervised clustering methods are presented. Experimental results show that the non-Gaussian statistical model-based methods are superior to the conventional Gaussian distribution-based method. They are meaningful tools for DNA methylation analysis. Moreover, among several non-Gaussian methods, the one that captures the bounded nature of DNA methylation data reveals the best clustering performance.
منابع مشابه
Predicting CpG Islands and DNA Methlation in the Cow Genome Using DNA Microarray Meta-Analysis and Genome Wide Scanning
DNA methylation is a type of epigenetic changes that directly affects DNA. In mammals, DNA methylation is essential for fetal development and stem cell differentiation and this phenomenon essentially occurs within the CpG islands. In this study, two methods were used to study the DNA methylation profile of cow genome. In the first method, the DNA methylation profile of the differentially expres...
متن کاملParameter Estimation in Spatial Generalized Linear Mixed Models with Skew Gaussian Random Effects using Laplace Approximation
Spatial generalized linear mixed models are used commonly for modelling non-Gaussian discrete spatial responses. We present an algorithm for parameter estimation of the models using Laplace approximation of likelihood function. In these models, the spatial correlation structure of data is carried out by random effects or latent variables. In most spatial analysis, it is assumed that rando...
متن کاملP-128: The Effect of DNA Methyl Transferase1 Inhibitor (RG108) on DNA Methylation Pattern of Cloned Mouse Embryos
Background: In somatic cell nuclear transfer (SCNT) method of cloning, transferred nucleus should be dedifferentiated from differentiated state to embryonic state. Molecular analysis showed that the reprogramming in the transferred nucleus was incomplete and cloned embryos have epigenetic abnormalities such as high DNA methylations levels. Since methylation in pre-implantation embryos has a cri...
متن کاملConditional Dependence in Longitudinal Data Analysis
Mixed models are widely used to analyze longitudinal data. In their conventional formulation as linear mixed models (LMMs) and generalized LMMs (GLMMs), a commonly indispensable assumption in settings involving longitudinal non-Gaussian data is that the longitudinal observations from subjects are conditionally independent, given subject-specific random effects. Although conventional Gaussian...
متن کاملO6-Methylguanine-DNA Methyltransferase and ATP-Binding Cassette Membrane Transporter G2 Promotor Methylation: Can Predict the Response to Chemotherapy in Advanced Breast Cancer?
Background: ATP-binding cassette membrane transporter G2 (ABCG2) gene is one of transporter family and well characterized for their association with chemoresistance. Promoter methylation is a mechanism for regulation of gene expression. O6-Methyl guanine DNA methyl transferase (MGMT) gene plays a fundamental role in DNA repair. MGMT has the ability to remove alkyl adducts from DNA at the O6 pos...
متن کامل